Perform first pass on input set 
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Allow only the most frequently 
occurring words to remain in the 
Hashtable 
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Perform second pass or input set 
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Add phrases made up of only 
words in Hashtable to Hashtable 
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Find the most frequently 
occurring words and phrases in 
the Hashtable 



FIGURE 1 



Remove punctuation, convert to 
lower case 



Remove stop words 



Replace words with synonyms 



Remove duplicate words 



Increment word count in Hashtable 



FIGURE 2 



Remove punctuation, convert to 
lower case 
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Remove stc 


)p words 






Replace words with synonyms 



Determine which phrases of two 
or more words are made up only of 
words that are in Hashtable 



33 



Increment the count of phrases 
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FIGURE 3 



